Enhancing GPU Performance for SHA-3 Algorithm: Optimizing Hashing Operations in a Parallel Computing Environment
نویسندگان
چکیده
Software implementation of Hash function have not been able to offer satisfactory performances for various application thus far. Additionally, SHA-3 and SHAKE, which utilize SHA-3, are extensively utilized in many Post Quantum Cryptosystem (PQC) . Therefore, there is a need research optimize the software environments. Our proposal involves an optimized on GPU environment. To improve performance efficiency, we suggest techniques such as optimizing internal processes inline PTX optimization, efficient memory usage, asynchronous CUDA stream application. After implementing these optimization methods, our SHA-3(512) (and SHA-3(256)) algorithm provides maximum throughput 88.51 Gb/s 171.62 Gb/s) RTX2080Ti without stream. The aims environment enhance efficiency. suggested include process applying SHA-3(256) algorithms provide Gb/s, respectively,
منابع مشابه
Optimizing Parallel Prefix Operations for the Fermi Architecture 3
The NVIDIA Fermi GPU architecture introduces new instructions designed to facilitate basic, but important, parallel primitives on per-thread predicates, as well as instructions for manipulating and querying bits within a word. This chapter demonstrates the application of these instructions in the construction of efficient parallel algorithm primitives such as reductions, scans, and segmented sc...
متن کاملParallel and Improved PageRank Algorithm for GPU-CPU Collaborative Environment
The internet is a huge collection of websites in the order of 10 bytes. Around 90% of the world’s population uses search engines for getting relevant information. According to Wikipedia, more than 200 million Indians use the Internet every day. Thus the correct data retrieval least time domain is the most important task. Hence need of efficient and parallel PageRanking algorithm. All the existi...
متن کاملAn Improved Image Segmentation Algorithm Based on GPU Parallel Computing
In the process of image segmentation, the classic Fuzzy C-Means (FCM) algorithm is time-consuming and depends heavily on initialization center. Based on Graphic Processing Unit (GPU), this paper proposes a novel FCM algorithm by improving the computational formulas of membership degree and the update criterion of cluster centers. Our algorithm can initialize cluster centers purposefully and fur...
متن کاملGenetic Algorithm Modeling with GPU Parallel Computing Technology
We present a multi-purpose genetic algorithm, designed and implemented with GPGPU / CUDA parallel computing technology. The model was derived from a multi-core CPU serial implementation, named GAME, already scientifically successfully tested and validated on astrophysical massive data classification problems, through a web application resource (DAMEWARE), specialized in data mining based on Mac...
متن کاملPerformance Analysis of Parallel Sorting Algorithms using GPU Computing
Sorting is a well interrogating issue in computer science. Many authors have invented numerous sorting algorithms on CPU (Central Processing Unit). In today's life sorting on the CPU is not so efficient. To get the efficient sorting parallelization should be done. There are many ways of parallelization of sorting but at the present time GPU (Graphics Processing Unit) computing is the most ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Indian Scientific Journal Of Research In Engineering And Management
سال: 2023
ISSN: ['2582-3930']
DOI: https://doi.org/10.55041/ijsrem24433